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Abstract 

This paper addresses a high level spatio-temporal problem, namely "absolute orientation”, which arises in visual- 
odometry (using stereo), or registering two models created by different Structure from Motion (SFM) reconstructions. 
We compare the veiy popular method due to Horn 1 using quaternions and our own independently derive method using 
the orthonormal rotation matrix R. We also introduce a novel approach for outlier rejection using spectral clustering 2 . 

Introduction: Absolute orientation, Horn 1 , is a higher level spatio-temporal problem that arises in visual-odometery, 
computational graphics, or extending 3D models using SFM. The problem statement is simple: given two sets of 
matched points. Pi and p 2 in R 3 , find the similarity transform that transforms pi into p 2 and reject any outliers that may 
be present. By similarity transform we mean the translation, rotation and scale such that: 1 

Pi = s * p 2 * R + t (1) 

where s is the scale ambiguity (arising if two sets of points are generated from an SFM output). R is the 3x3 
orthonormal rotation matrix, and t is the translation vector. 

Several authors have already addressed this problem. Perhaps the absolute reference on absolute orientation is Horn 1 , 
where a unit quaternion (on SCf) represents the rotation. Horn, Hilden and Negahdaripour 3 too address the absolute 
orientation problem, but use the rotation matrix R. This approach is later rejected for reasons such as: 

• Rotation matrices have too many parameters - i.e. 9 instead of 4 parameters for quaternions. 

• Rotation matrices have "strange” constraints, i.e. R * R T has to be the 3x3 identity matrix, and det(R) ~ 1. 

• Lastly, quaternions are ideal for interpolative purposes compared to the rotation matrix R. 4 

The first two objections are rather mute. First, to solve for proper quaternion, Horn creates a 9x9 matrix and solves for 
variables bilinear in the elements of q. Second, a unit quaternion q has to satisfy the constraint q # q* = 1 where q* is 
the conjugate of q, and • represents quaternion multiplication; Horn addresses this using a Lagrange multiplier. 
Regarding interpolation, Michaels and Boult are quite correct. While rotation matrices do not lend themselves, to 
interpolation, transforming them to Rodrigez representation is trivial. 

In the next section we provide a fast and robust technique to address the absolute orientation problem and also 
introduce a new approach to outlier rejection based on spectral clustering 2 which has proven extremely robust. 

Mathematical Derivation: Let us assume for now, that we have already solved for translation, t, and scale, s, in 
equation 1. Now the two sets of 3D points after correction for these parameters are v and w, and we have Vj = R w*. 

We like to minimize the sum squared error: e 2 = X* j v* - R w* | 2 . Expanding e 2 we see that it is equivalent to 
maximizing : cr = 2* v 7 R w* . Adding the orthonormality constraint for R, i.e., I - R * R T = 0, where I is the identity 
matrix and using the symmetric Langrange multiplier M, we need to maximize: 

G(R) = Xi Vi T R Wi - H trace (M (I-R * R T )) (2) 

Umeyama 5 also provides a solution to this problem, adding in another Lagrange multiplier for the det(R)=l. As we 
will see shortly, our approach provides a simpler solution to the problem. 11 Differentiating G(R) with respect to matrix 
R, and using trace of scalar is the scalar, and trace(AB) = trace(BA) we can show that: 

K-MR T = fl (3) 

where K = Zj w* v, T . Using K = MR 7 and multiplying both sides by their own transpose we get K K 7 ~ M 2 . That is 
M is the symmetric square root of the matrix K K 7 Now using singular value decomposition of K = U D V T , where D 
is the diagonal matrix of singular values, M = UDU T But from equation 3, R = K' 1 M, therefore: 

R = V U T (4) 


1 We represent points by lower case bold, matrices by higher case bold, and scalars with lower case letters. 

u We received the reference to Umeyama's work after our derivation. Although outliers are not addressed in this work, the 

mathematical treatment is very elegant, and interested readers are encourage to examine this work 



Equation 4 is both simple and extremely easy to calculate. The matrix K is 3x3 and its SVD is trivial. To make sure 
the calculated R is not a reflection matrix, simply take its determinant. If the result is -L multiply the last row by -1 

Lastly, note that if we subtract the centeroid of the points Vj and w, from these points, equation (2) still holds, but the 
translation vector, t, is removed. As a result the only remaining issues are how to preprocess the points p t and p 2 to 
remove the outliers, and how to find the scale factor s. Once the scale, s, parameter is determined we can rescale the 
remaining points, subtract their respective centeroids, and find the rotation matrix R using equation 4. With the scale, s, 
and rotation matrix R in hand, finding the least square translation vector, t, using equation 1 is trivial. 

Determining the Scale and Outlier Rejection: Let Dj and D 2 be the symmetric matrices that contain the distance 
between every two points in pi and in p 2 respectively. If W is the element by element division of D 2 by D 2 except for 
the diagonals which should remain 0, then the elements of W are the scale factor, s, plus added noise. The exception 
being the rows and columns of W corresponding to outliers (i.e., mismatched points). One popular method of rejecting 
the outliers is RANSAC 6 . But using spectral clustering, it can be shown that the Eigen vector of W, e b corresponding 
to W's largest Eigen values, is almost blockwise constant. That is, for all elements of e 1 only those that correspond to 
the rows (or columns) of W containing the outliers have a value different from the rest. The remaining values of are 
almost all constant (with very minor differences due to noise). This gives us the following simple technique to reject 
the outliers: 1) construct W. 2) find the vector e 2 for W’s largest Eigen value. 3) sort ej, keeping track of the changes in 
its rows. 4) create the permutation matrix Q by rearranging the columns of the identity matrix, I, according the changes 
in the ej. 5) create the matrix X = Q T W Q. This matrix is diagonally block structured and s can easily be derived. It is 
noteworthy, that one can use RANSAC on the block diagonal elements of X matrix thus increasing the speed of 
RANSAC by filtering out obvious outliers. 

Experimental Results: Figure la shows the results of our Monte-Carlo simulations. Three techniques, Horn, our 
SVD based derivation and QR decomposition of the matrix R derived using least squares were compared. While the 
QR method was the fastest -see figure La - it also generated the most unreliable solutions. Our approach was faster 
than Horn’s technique (or its equivalents) , with the sum square error of the two methods being indistinguishable. 

Figures 1 .b and l.c show the structure of the X matrix for 40% outliers and 96% outliers! 
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Figure 1. (a) The results of Monte Carlo simulation for Horn, QR, and our SVD based method, (b) and (c) the block diagonal structure of 

the spectral clustering matrix X for 40% and 96% outliers. 
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m This is equivalent to reflecting around the Z axis and arises from the properties of minimum Eigen values. 




